Model Assessment with ROC Curves
نویسنده
چکیده
Introduction Classification models and in particular binary classification models are ubiquitous in many branches of science and business. Consider, for example, classification models in bioinformatics that classify catalytic protein structures as being in an active or inactive conformation. As an example from the field of medical informatics we might consider a classification model that, given the parameters of a tumor, will classify it as malignant or benign. Finally, a classification model in a bank might be used to tell the difference between a legal and a fraudulent transaction. this is accomplished by using metrics derived from the confusion matrix or contingency table. However, it has been recognized that (a) a scalar is a poor summary for the performance of a model in particular when deploying non-parametric models such as artificial neural networks or decision trees (Provost, Fawcett, & Kohavi, 1998) and (b) some performance metrics derived from the confusion matrix are sensitive to data anomalies such as class skew (Fawcett & Flach, 2005). Recently it has been observed that Receiver Operating Characteristic (ROC) curves visually convey the same information as the confusion matrix in a much more intuitive and robust fashion (Swets, Dawes, & Monahan, 2000). Here we take a look at model performance metrics derived from the confusion matrix. We highlight their shortcomings and illustrate how ROC curves can be deployed for model assessment in order to provide a much deeper and perhaps more intuitive analysis of the models. We also briefly address the problem of model selection.
منابع مشابه
ارزیابی حساسیتپذیری فرسایش آبکندی با استفاده از رگرسیون لجستیک، در حوضه صلواتآباد استان کردستان
Introduction Gully is one of the forms of water erosion features in many regions of the world. Gully erosion could produce abundant sediment load, reducing the fertility of land and destruction of structures. In literature, for the preparation of landslide susceptibility mapping, several studies have been conducted using logistic regression. But, a few authors focused on gully erosion suscep...
متن کاملComparing ANN and CART to Model Multiple Land Use Changes: A Case Study of Sari and Ghaem-Shahr Cities in Iran
Most of the land use change modelers have used to model binary land use change rather than multiple land use changes. As a first objective of this study, we compared two well-known LUC models, called classification and regression tree (CART) and artificial neural network (ANN) from two groups of data mining tools, global parametric and local non-parametric models, to model multiple LUCs. The ca...
متن کاملROC Curves for Steganalysts
There are different approaches in the literature for the assessment of steganographic algorithms and steganalytic attacks. In the early papers it was considered sufficient to show the existence of an effect for one or a few examples only. The more the area of steganography evolved, the more diverse became the goals and the harder to measure the improvements. Many branches of science are facing ...
متن کاملMixtures of receiver operating characteristic curves.
RATIONALE AND OBJECTIVES Receiver operating characteristic (ROC) curves are ubiquitous in the analysis of imaging metrics as markers of both diagnosis and prognosis. While empirical estimation of ROC curves remains the most popular method, there are several reasons to consider smooth estimates based on a parametric model. MATERIALS AND METHODS A mixture model is considered for modeling the di...
متن کاملSymmetry Properties of Bi-Normal and Bi-Gamma Receiver Operating Characteristic Curves are Described by Kullback-Leibler Divergences
Receiver operating characteristic (ROC) curves have application in analysis of the performance of diagnostic indicators used in the assessment of disease risk in clinical and veterinary medicine and in crop protection. For a binary indicator, an ROC curve summarizes the two distributions of risk scores obtained by retrospectively categorizing subjects as cases or controls using a gold standard....
متن کامل